PyDigger - unearthing stuff about Python


NameVersionSummarydate
mineru 2.1.4 A practical tool for converting PDF to Markdown 2025-07-23 07:53:18
linkrot 5.2.2 Extract metadata and URLs from PDF files 2025-07-22 18:53:37
BrazilFiscalReport 0.5.33 Python library for generating Brazilian auxiliary fiscal documents in PDF from XML documents. 2025-07-22 17:36:15
docling 2.42.1 SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications. 2025-07-22 16:47:03
asposepdfcloud 25.7.0 Aspose.PDF Cloud 2025-07-22 16:44:38
ocr-document-converter 3.1.0 Enterprise-grade OCR and document conversion tool with dual OCR engines 2025-07-22 15:19:03
markitdown-pdf-separators 0.4.1 MarkItDown with PDF page separators - convert PDFs to Markdown with page boundary markers 2025-07-22 14:44:59
product-connections-manager 1.0.1 A comprehensive platform for managing Product Connections operations with automated EDR printing 2025-07-22 09:43:37
exparso 0.0.3 Analyzing and parsing documents 2025-07-22 09:43:16
simulchip 0.2.1 Compare NetrunnerDB decklists against local card collection and generate PDF proxies 2025-07-22 05:38:57
codex 1.8.4 A comic archive web server. 2025-07-21 23:40:03
comicbox 2.0.1 Comic book archive multi format metadata read/write/transform tool and image extractor. 2025-07-21 16:13:39
llm-data-converter 2.1.6 Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract 2025-07-21 12:19:10
txt2ebook 0.1.149 CLI tool to convert txt file to ebook format 2025-07-20 08:44:12
wdoc 3.3.0 A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (url, pdf, epub, youtube (incl playlist), audio, anki, md, docx, pptx, oe any combination!) 2025-07-19 12:44:46
gs-pdf-compress 0.2.1 Compress PDF files with Ghostscript 2025-07-18 23:29:09
pdfix-sdk 8.7.2 PDFix SDK - Automated PDF Remediation, Data Extraction, HTML Conversion 2025-07-18 06:51:27
pdf-tools-mcp 0.1.3 A FastMCP-based PDF reading and manipulation tool server 2025-07-18 03:32:46
o7pdf 1.0.1 PDf Reports 2025-07-17 13:24:06
txtify 0.1.2 A versatile Python tool to convert documents (PPTX, DOCX, PDF, XLSX) to plain text, ideal for providing context to AI code assistants like GitHub Copilot and Amazon CodeWhisperer. 2025-07-17 03:24:47
hourdayweektotal
93227410461300952
Elapsed time: 3.94407s